skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Zhao, Jianshu"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract Understanding how populations respond to disturbances represents a major goal for microbial ecology. While several hypotheses have been advanced to explain microbial community compositional changes in response to disturbance, appropriate data to test these hypotheses is scarce, due to the challenges in delineating rare vs. abundant taxa and generalists vs. specialists, a prerequisite for testing the theories. Here, we operationally define these two key concepts by employing the patterns of coverage of a (target) genome by a metagenome to identify rare populations, and by borrowing the proportional similarity index from macroecology to identify generalists. We applied these concepts to time-series (field) metagenomes from the Piver’s Island Coastal Observatory to establish that coastal microbial communities are resilient to major perturbations such as tropical cyclones and (uncommon) cold or warm temperature events, in part due to the response of rare populations. Therefore, these results provide support for the insurance hypothesis [i.e. the rare biosphere has the buffering capacity to mitigate the effects of disturbance]. Additionally, generalists appear to contribute proportionally more than specialists to community adaptation to perturbations like warming, supporting the disturbance-specialization hypothesis [i.e. disturbance favors generalists]. Several of these findings were also observed in replicated laboratory mesocosms that aimed to simulate disturbances such as a rain-driven washout of microbial cells and a labile organic matter release from a phytoplankton bloom. Taken together, our results advance understanding of the mechanisms governing microbial population dynamics under changing environmental conditions and have implications for ecosystem modeling. 
    more » « less
  2. Abstract Surveys of microbial communities (metagenomics) or isolate genomes have revealed sequence-discrete species. That is, members of the same species show >95% average nucleotide identity (ANI) of shared genes among themselves vs. <83% ANI to members of other species while genome pairs showing between 83% and 95% ANI are comparatively rare. In these surveys, aquatic bacteria of the ubiquitous SAR11 clade (Class Alphaproteobacteria) are an outlier and often do not exhibit discrete species boundaries, suggesting the potential for alternate modes of genetic differentiation. To explore evolution in SAR11, we analyzed high-quality, single-cell amplified genomes, and companion metagenomes from an oxygen minimum zone in the Eastern Tropical Pacific Ocean, where the SAR11 make up ~20% of the total microbial community. Our results show that SAR11 do form several sequence-discrete species, but their ANI range of discreteness is shifted to lower identities between 86% and 91%, with intra-species ANI ranging between 91% and 100%. Measuring recent gene exchange among these genomes based on a recently developed methodology revealed higher frequency of homologous recombination within compared to between species that affects sequence evolution at least twice as much as diversifying point mutation across the genome. Recombination in SAR11 appears to be more promiscuous compared to other prokaryotic species, likely due to the deletion of universal genes involved in the mismatch repair, and has facilitated the spread of adaptive mutations within the species (gene sweeps), further promoting the high intraspecies diversity observed. Collectively, these results implicate rampant, genome-wide homologous recombination as the mechanism of cohesion for distinct SAR11 species. 
    more » « less
  3. Abstract Genome search and/or classification typically involves finding the best-match database (reference) genomes and has become increasingly challenging due to the growing number of available database genomes and the fact that traditional methods do not scale well with large databases. By combining k-mer hashing-based probabilistic data structures (i.e. ProbMinHash, SuperMinHash, Densified MinHash and SetSketch) to estimate genomic distance, with a graph based nearest neighbor search algorithm (Hierarchical Navigable Small World Graphs, or HNSW), we created a new data structure and developed an associated computer program, GSearch, that is orders of magnitude faster than alternative tools while maintaining high accuracy and low memory usage. For example, GSearch can search 8000 query genomes against all available microbial or viral genomes for their best matches (n = ∼318 000 or ∼3 000 000, respectively) within a few minutes on a personal laptop, using ∼6 GB of memory (2.5 GB via SetSketch). Notably, GSearch has an O(log(N)) time complexity and will scale well with billions of genomes based on a database splitting strategy. Further, GSearch implements a three-step search strategy depending on the degree of novelty of the query genomes to maximize specificity and sensitivity. Therefore, GSearch solves a major bottleneck of microbiome studies that require genome search and/or classification. 
    more » « less